Sfoglia per Rivista MACHINE LEARNING
Lowest probability mass neighbour algorithms: relaxing the metric constraint in distance-based neighbourhood algorithms
2019-01-01 Ting, K. M.; Zhu, Y.; Carman, M.; Zhu, Y.; Washio, T.; Zhou, Z. -H.
Policy gradient in Lipschitz Markov Decision Processes
2015-01-01 Pirotta, Matteo; Restelli, Marcello; Bascetta, Luca
Policy space identification in configurable environments
2021-01-01 Metelli, A. M.; Manneschi, G.; Restelli, M.
Smoothing policies and safe policy gradients
2022-01-01 Papini, M.; Pirotta, M.; Restelli, M.
Titolo | Data di pubblicazione | Autori | File |
---|---|---|---|
Lowest probability mass neighbour algorithms: relaxing the metric constraint in distance-based neighbourhood algorithms | 1-gen-2019 | Carman M. + | |
Policy gradient in Lipschitz Markov Decision Processes | 1-gen-2015 | PIROTTA, MATTEORESTELLI, MARCELLOBASCETTA, LUCA | |
Policy space identification in configurable environments | 1-gen-2021 | Metelli A. M.Restelli M. + | |
Smoothing policies and safe policy gradients | 1-gen-2022 | Papini M.Pirotta M.Restelli M. |
Legenda icone
- file ad accesso aperto
- file disponibili sulla rete interna
- file disponibili agli utenti autorizzati
- file disponibili solo agli amministratori
- file sotto embargo
- nessun file disponibile